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A new regression model search algorithm was developed in 2011 that may 
be used to analyze both general multivariate experimental data sets and wind 
tunnel strain gage balance calibration data. The new algorithm is a simplified 
version of a more complex search algorithm that was originally developed at 
the NASA Ames Balance Calibration Laboratory. The new algorithm has the 
advantage that it needs only about one tenth of the original algorithm’s CPU 
time for the completion of a search. In addition, extensive testing showed that 
the prediction accuracy of math models obtained from the simplified algorithm 
is similar to the prediction accuracy of math models obtained from the original 
algorithm. The simplified algorithm, however, cannot guarantee that search 
constraints related to a set of statistical quality requirements are always satisfied 
in the optimized regression models. Therefore, the simplified search algorithm 
is not intended to replace the original search algorithm. Instead, it may be 
used to generate an alternate optimized regression model of experimental data 
whenever the application of the original search algorithm either fails or requires 
too much CPU time. Data from a machine calibration of NASA’s MK40 force 
balance is used to illustrate the application of the new regression model search 
algorithm. 


Nomenclature 


AF = axial force 

N 1 = normal force at the forward normal force gage of the balance 

N 2 = normal force at the aft normal force gage of the balance 

R1 = electrical outputs of the forward normal force gage 

R2 = electrical outputs of the aft normal force gage 

i?3 = electrical outputs of the forward side force gage 

i?4 = electrical outputs of the aft side force gage 

R5 = electrical outputs of the rolling moment gage 

R6 = electrical outputs of the axial force gage 

RM = rolling moment 

51 = side force at the forward side force gage of the balance 

52 = side force at the aft side force gage of the balance 


I. Introduction 

Between 2005 and 2009 a regression model search algorithm was developed for the NASA Ames Balance 
Calibration Laboratory. The search algorithm identifies regression models of multivariate experimental data 
sets that meet strict statistical quality requirements and prevent “over fitting” of the dependent variable 
(see, e.g., Refs. [1] to [5] for a description of different aspects of the original search algorithm). The original 
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search algorithm was implemented in NASA’s BALFIT software package (see Ref. [6]). Currently BALFIT 
and the search algorithm are applied on a regular basis to wind tunnel strain-gage balance calibration data 
sets at both NASA Ames Research Center and NASA Langley Research Center. 

Experience showed that the original regression model search algorithm is not always able to successfully 
complete a search. The failure of the search is usually associated with the detection of a discontinuity in the 
algorithm’s search metric. Several causes for the failure of a regression model search were identified. Most 
of them were related (i) to limitations of the regression model search algorithm itself or (ii) to shortcomings 
and imperfections in experimental data sets. Some of the limitations of the original search algorithm were 
removed by giving a user of the algorithm a greater choice of (i) math term group combinations, (ii) search 
constraint options, and (iii) search directions. In addition, improvements of BALFIT’s analysis report format 
were made so that shortcomings in experimental data sets can more easily be spotted. 

In 2010, after analyzing problems associated with some wind tunnel strain-gage balance calibration 
data, it was concluded that the application of the original search algorithm is also sometimes unsuccessful 
because tare corrections have to be applied to some balance data sets before the regression model search 
can be started (see Ref. [7] for a detailed description of the tare load iteration process). Unfortunately, an 
accurate tare correction assessment can often only be obtained if a very good estimate of the final optimized 
regression model of the balance calibration data is already known ( “chicken-and-egg” problem). Therefore, 
the author attempted to simplify several elements of the original regression model search algorithm such 
that the CPU time needed for the search would be significantly reduced while still enforcing the three search 
constraints of the original search algorithm. These efforts lead to the development of the simplified regression 
model search algorithm that is discussed in the present paper. 

The most important elements of the simplified regression model search algorithm are briefly summarized 
in the next section of the proposed conference paper. Then, data from the machine calibration of a wind 
tunnel strain-gage balance is used to assess (i) the CPU time needed by the simplified algorithm for the 
completion of a regression model search and (ii) the predictive capability of its optimized regression models. 

II. Simplified Search Algorithm 

The simplified regression model search algorithm was derived from a more complex algorithm that was 
developed for the NASA Ames Balance Calibration Laboratory between 2005 and 2010 (see Refs. [1] to [4] 
for details about the original more complex algorithm). The development of the simplified search algorithm 
became necessary because some wind tunnel strain-gage balance calibration data sets require tare corrections 
before the original more complex search algorithm can be applied. Accurate tare corrections, however, can 
often only be obtained if already a good “guess” of the final optimized regression model is available before 
the beginning of the search (chicken-and-egg problem). This “guess”, in order to be useful, has to be 
determined within a fraction of the CPU time that is needed for the original more complex search process. 
The simplified search algorithm fulfills all these requirements. 

The simplified search algorithm has matured since it was first developed in 2011. It is now included in 
NASA’s BALFIT software package as a new “Math Model Type” choice that an analyst can make (i.e. , the 
“Math Model Type” choice “Suggested Math Model”). The simplified search algorithm uses many of the 
statistical quality metrics and search constraints that the original search algorithm applies (see Ref. [5] for 
a general discussion of different metrics that may be used to evaluate a regression model of experimental 
data). However, the metrics and constraints are applied in a simplified fashion in order to greatly reduce 
the total number of math models that have to be tested during the regression model search. 

Figure 1 summarizes basic elements of the simplified regression model search algorithm. The search 
starts by first selecting a combination of math term groups (function classes) that help define an upper 
bound for the regression model search. The math term group combination is usually selected by using 
subject matter knowledge about the given experimental data set. The chosen combination, e.g., may consist 
of linear terms, cross-product terms, square terms, and absolute value terms. Not every term of the chosen 
math term groups may be supported by the given experimental data set. Therefore, the upper bound, i.e., 
the Permitted Math Model, needs to be determined for the regression model search such that only non- 
singular solutions of the related least squares problem exist. This is accomplished by using a numerical 
technique called Singular Value Decomposition (SVD). In the next step, the variance inflation factors of 
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the terms of the Permitted Math Model are inspected to make sure that the upper bound does not have 
unwanted near-linear dependencies. Terms are typically omitted in the Permitted Math Model that exceed 
the literature recommended variance inflation factor threshold of 10. It is important to point out that the 
simplified search algorithm removes terms such that the Hierarchy Rule is still satisfied. In other words, 
each term of the Permitted Math Model will still have all lower order terms after the omission of terms that 
violate the variance inflation factor threshold. In the next step, the revised upper bound for the simplified 
search is applied to the experimental data set and the p-values of the t-statistic of each regression model 
coefficient are computed. Now, “insignificant” terms, i.e., terms that have a p-value of greater than 0.0001 
are removed in order to obtain the final optimized regression model. Again, the term removal is performed 
such that the Hierarchy Rule is maintained in the final optimized regression model. 

A more detailed comparison of the simplified and more complex regression model search algorithms will 
be included in the final manuscript of the proposed conference paper. This discussion will also compare the 
CPU time needed for the regression model search for different math term group combinations and other 
search parameter that the simplified and more complex search algorithm have in common. In the next 
section of the extended abstract a data example is discussed to illustrate different aspects of the application 
of the simplified regression model search algorithm. 

III. Discussion of Example 

Data from the calibration of the NASA Ames MK40 force balance was chosen to illustrate steps needed 
for the application of the simplified regression model search algorithm. The MK40 is a wind tunnel strain- 
gage balance that was originally manufactured by the Task Corporation. It is a six-component force balance 
that measures five forces and one moment (-/VI, N2 , 51, 52, AF, RM). The balance has a diameter of 
2.5 inches and a total length of 17.31 inches. Table 1 shows the load capacity of each load component. 

Table 1: Load capacities of the NASA Ames 2.5in MK40 balance. 



iVl, lbs 

N 2, lbs 

51, lbs 

52, lbs 

RM, in-lbs 

AF, lbs 

CAPACITY 

3500 

3500 

2500 

2500 

8000 

400 


The balance calibration data was obtained in 2008 in Triumph Aerospace’s balance calibration machine. 
The supplied machine calibration data was already corrected for the weight of the balance shell and other 
calibration fixtures. Therefore, the tare load iteration process was omitted during the analysis of the data. 
Table 2 below summarizes important characteristics of the balance and its calibration data set. 


Table 2: Balance and calibration data set characteristics of the MK40 balance. 


BALANCE NAME 

MK40 

BALANCE TYPE 

FORCE BALANCE (TASK DESIGN) 

DIAMETER 

2.5 [in] 

CALIBRATION DATE 

JULY 2008 

CALIBRATION METHOD 

MACHINE CALIBRATION 

TOTAL NUMBER OF CALIBRATION POINTS 

1863 

LOAD FORMAT 

TARE CORRECTED LOADS 

GAGE OUTPUT FORMAT 

GAGE OUTPUT DIFFERENCES 


First, in order to start the regression model search, it was necessary to define an upper bound, i.e., the 
Permitted Math Model . The balance is a multi-piece balance (TASK design). Therefore, as the data is to 
be processed using the Iterative Method recommended in Ref. [7], it was decided to include absolute value 
terms in the math term group combination for the regression models of the six gage outputs of the balance. 
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The following groups were chosen for the regression model of the gage outputs 


F, in F ■ F, F-\F\, F-G 

where F and G are symbols representing the load components of the balance. The balance calibration 
data set was obtained in a calibration machine that supports all load combinations that can be constructed 
from the five chosen math term groups. Therefore, after including the intercept in the regression models of 
the gage outputs, the initial upper bound for the search has a total of 40 terms for each component. The 
corresponding regression models of the gage outputs R1 to R6 are given in Fig. 2. 

Specific knowledge about the characteristics of a TASK balance may be used to reduce the number of 
regression model terms in the initial upper bound that is given in Fig. 2. In a recent paper it was shown that 
not all gages of a TASK balance have the bi-directional characteristic that traditionally justifies the selection 
of absolute value terms for a TASK balance (see Ref. [8]). Figure 3a shows, for example, the bi-directionality 
of the MK40 balance. It can clearly be seen that the rolling moment gage R5 and the axial force gage R6 
are not bi-directional (cf. last two plots in Fig. 3a). Consequently, there is no justification for the use of 
the terms |AF| and RM in the regression models of the gage outputs of the MK40 balance. Then, after 
removing the related terms from the initial upper bound, a new 36-term upper bound of the math model is 
obtained that is shown in Fig. 3b. 

An initial analysis of the variance inflation factors of the 36-term upper bound for the regression model of 
the forward normal force gage output R1 is shown in Fig. 3c. A near-linear dependency between the absolute 
value terms of the normal/side force components and corresponding quadratic terms appears to exist as terms 
of those two function classes have variance inflation factors that exceed the literature recommended threshold 
of 10 (see red boxes in Fig. 3c). The connection between the absolute value and quadratic terms comes from 
the fact that both the absolute value function and the quadratic function are even. The variance inflation 
factor analysis for the MK40 data set indicates that either the absolute value terms or the quadratic terms of 
the normal/side force components can be omitted in the regression models. A TASK balance is, by design, 
a very linear device. Therefore, it was decided to omit the quadratic terms of the normal and side force 
components. The corresponding second revision of the upper bound for the regression model search is shown 
in Fig. 3d. 

Now, the original and the simplified search algorithms can be applied to the MK40 data set by using 
the math model shown in Fig. 3d as the upper bound for the regression models of the gage outputs. Fig- 
ure 4a shows the optimized models of the gage outputs that were obtained after applying the original search 
algorithm. It took approximately 28 minutes of CPU time to complete the search. Figure 4b shows the cali- 
bration load residuals of the six load components that are obtained after applying the data reduction matrix 
derived from the optimized models of Fig. 4a to the gage outputs. Figure 5a shows the optimized models of 
the gage outputs that were obtained after applying the simplified search algorithm. It took approximately 3 
minutes of CPU time to complete the search. Figure 5b shows the calibration load residuals of the six load 
components that are obtained after applying the data reduction matrix derived from the optimized models 
of Fig. 5a to the gage outputs. 

It is interesting at this point to directly compare the standard deviation of the load residuals that can 
be obtained for the five different regression model sets discussed above. Table 3 below lists the standard 
deviation as a percentage of the capacity for the six load components. 

Table 3: Standard deviation of the load residuals in percent of the load capacity. 


MATH MODEL 

Nl 

N 2 

51 

52 

RM 

AF 

Fig. 2 

0.1008 % 

0.0967 % 

0.2115 % 

0.1762 % 

0.1780 % 

0.1354 % 

Fig. 3b 

0.1101 % 

0.1032 % 

0.2402 % 

0.1822 % 

0.2011 % 

0.1709 % 

Fig. 3d 

0.1207 % 

0.1063 % 

0.2456 % 

0.1931 % 

0.2016 % 

0.1760 % 

Fig. 4a 

0.1216 % 

0.1065 % 

0.2473 % 

0.1986 % 

0.2055 % 

0.1808 % 

Fig. 5a 

0.1229 % 

0.1075 % 

0.2481 % 

0.1942 % 

0.2065 % 

0.1782 % 
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Overall, the standard deviation of the load residuals compare very well for the five regression model 
sets. The comparison of the standard deviations also illustrates that differences between the predictive 
capability of the regression models of the original seach algorithm, i.e., Fig. 4a, and the regression models 
of the simplfied seach algorithm, i.e., Fig. 5a, are very small (less than 0.005 % of load capacity). 

IV. Conclusions 

A new simplified regression model search algorithm was described. The new algorithm was derived from 
a more complex search algorithm that was originally developed at the NASA Ames Balance Calibration 
Laboratory. The new simplified search algorithm tries to rapidly identify a regression model for a given 
experimental data set that meets strict statistical quality requirements and simultaneously prevents “over- 
fitting” of the data. The simplified search algorithm has the advantage that it requires only about one 
tenth of the original algorithm’s CPU time for the completion of a regression model search. However, the 
simplified search algorithm cannot guarantee that the final optimized regression model will always meet the 
chosen statistical quality requirements. Therefore, the simplified search algorithm is intended to be used for 
the generation of an alternate optimized regression model of the given data whenever the application of the 
original search algorithm either fails or requires too much CPU time. In addition, the simplified algorithm 
may be used to get better initial estimates of tare corrected loads of wind tunnel strain-gage balance 
calibration data. These improved estimates are needed as input whenever the original search algorithm is 
applied to strain-gage balance calibration data that is not yet corrected for the weight of the calibration 
body and other calibration hardware components. 

Machine calibration data of NASA’s MK40 six-component force balance is used to illustrate the applica- 
tion of the simplified search algorithm to a strain-gage balance data set. First, it is explained how a suitable 
upper bound for the regression model search can be defined for the chosen balance calibration data set. New 
information about the bi-directionality characteristics of the individual gages of a TASK balance is used for 
this purpose. Therefore, all absolute value terms of the rolling moment and axial force and the quadratic 
terms of the normal and side force components are omitted in the regression models of the gage outputs. 
Then, both the original and the simplified algorithm are applied to the calibration data set using the upper 
bound to limit the regression model search. This analysis demonstrated that the prediction accuracy, i.e., 
the standard deviation of the load residuals, shows excellent agreement whenever results for the optimized 
models of the original search algorithm are compared with results for the optimized models of the simplified 
search algorithm. The simplified algorithm, however, needed only about one tenth of the original algorithm’s 
CPU time for the regression model search. 
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EXPERIMENTAL DATA SET 


SELECT MATH TERM GROUP OR FUNCTION CLASS COMBINATION 
(NEEDED FOR DEFINITION OF THE PERMITTED MATH MODEL ) 



USE SVD TO DETERMINE THE PERMITTED MATH MODEL 
(LARGEST NON-SINGULAR MATH MODEL FOR GIVEN DATA SET) 



USE PERMITTED MATH MODEL TO INITIALIZE OPTIMIZED MATH MODEL 
( OPTIMIZED MATH MODEL IS ALSO CALLED PRELIMINARY MATH MODEL ) 



DETERMINE UPPER BOUND OF OPTIMIZED MATH MODEL BY ITERATIVELY 
SATISFYING THE CHOSEN VARIANCE INFLATION FACTOR THRESHOLD 



MAKE UPPER BOUND "HIERARCHICAL" BY ADDING 
MISSING LOWER ORDER TERMS TO MATH MODEL 



DETERMINE P-VALUES OF T-STATISTIC OF ALL 
COEFFICIENTS OF THE "HIERARCHICAL" UPPER BOUND 



REMOVE TERMS OF UPPER BOUND THAT VIOLATE P-VALUE THRESHOLD 
- REMOVAL IS PERFORMED FROM HIGHER TO LOWER ORDER TERMS 
- TERM IS ONLY REMOVED IF MATH MODEL REMAINS HIERARCHICAL 



OPTIMIZED MATH MODEL 


Fig. 1 Basic elements of the simplified regression model search algorithm. 
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NUMBER OF TERMS - 40, 40, 40, 40, 40, 40 

HIERARCHICAL: R1, R2, R3, R4, R5, R6 
(HIERARCHY ANALYSIS USES IF*GI = iFklGl, lF*F*Fl= IFUIFI* lift, IFI«IFI=F»F) 



R1 

R2 

R3 

R4 

R5 

R6 

INTERCEPT 







N2 







SI 







S2 







RM 







AF 







In i l 







IN2I 







IS1 1 







IS2I 







IRMI 







lAFl 







N1*N1 







N2*N2 







S1*S1 







S2*S2 







RM*RM 







AF*AF 







N1*IN1 1 







N2*lN2l 







SI *IS1 1 







S2*lS2l 







RMflRMl 







AF*lAFl 







N1 »N2 







N1 *S1 







N1 *S2 







N1*RM 







N1*AF 







N2*S1 







N2*S2 







N2*RM 







N2*AF 







SI *S2 







SI *RM 







SI *AF 







S2*RM 





H 

H 

S2*AF 







RM*AF 







IN 1 *N2 1 

□ 

m m m m m 

IN1*S1 1 

□ 

m m m m m 

lN1*S2l 

□ 

m m m m m 

INHRMI 

□ 

m m m m m 

IN1*AFI 

□ 

m m m m m 

IN2*S1 1 

□ 

m m m m m 

lN2*S2l 

□ 

m m m m m 

lN2*RMl 

□ 

m m m m m 

lN2*AFl 

□ 

m m m m m 

IS1 *S2l 

□ 

m m m m m 


R1 R2 R3 R4 R5 


IS 1 *RMl 

m m m m m 

IS 1 *AFl 

m m m m m 

IS2*RK/ll 

m m m m m 

IS2*AFl 

m m m m m 

|RM*AFI 

m m m m m 

N1 *IN2I 

m m m m m 

N 1 * IS 1 1 

m m m m m 

N1*IS2I 

m m m m m 

N 1 * IRM 1 

m m m m m 

N1*IAFI 

m m m m m 

N2 * IS 1 1 

m m m m m 

N2*IS2I 

m m m m m 

N2*lRMl 

m m m m m 

N2*lAFl 

m m m m m 

S1*IS2I 

m m m m m 

SI* IRM 1 

m m m m m 

SI* lAFl 

m m m m m 

S2*lRMl 

m m m m m 

S2* lAFl 

m m m m m 

RM.IAFI 

m m m m m 

INI |*N2 

m m m m m 

Ini I*si 

m m m m m 

INI l*S2 

m m m m m 

INI l*RM 

m m m m m 

Ini I*af 

m m m m m 

lN2l*S1 

m m m m m 

lN2l*S2 

m m m m m 

IN2URM 

m m m m m 

IN2I+AF 

m m m m m 

IS1 l*S2 

m m m m m 

IS1 l*RM 

m m m m m 

IS1 l*AF 

m m m m m 

IS2l*RM 

m m m m m 

IS2l*AF 

m m m m m 

IRMUAF 

m m m m m 

N1 *N 1 *N1 

m m m m m 

N2*N2*N2 

m m m m m 

SI *S1 *S1 

m m m m m 

S2*S2*S2 

m m m m m 

RM*RM*RM 

m m m m m 

AF*AF*AF 

m m m m m 

IN 1 *N 1 *N 1 1 

m m m m m 

1 N2*N2*N2 1 

m m m m m 

IS1*S1*S1 1 

m m m m m 

lS2*S2*S2l 

m m m m m 

|RM*RM*RMI 

m m m m m 

lAF*AF*AFl 

m m m m m 


Fig. 2 Initial permitted math model of the MK40 calibration data set. 
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A(R6,AF), microV/V A(R5,RM), mlcroV/V A(R4,S2), microV/V A(R3,S1), microV/V A(R2,N2), microV/V A(R1 ,N1), microV/V 


THRESHOLO = ±5.9 [microV/V] ; INDICATOR VARIABLE AT CAPACITY = 15.7 [microV/V] > Bl- DIRECTIONAL 


4 ^ ?sn ^T v 'JDO Alpjiuii THRESH0LD ( s 

EE FOOTNOTE) n . L 

THRESHOLD (S 

EE FOOTNOTE) 


-3500. 0. 3500. 

N1, lbs 


THRESHOLD = ±7.2 [microV/V] ; INDICATOR VARIABLE AT CAPACITY = 23.5 [microV/V] > Bl- DIRECTIONAL 



-3500. 0. 3500. 

N2, lbs 


THRESHOLD = ±6.2 [microV/V] ; INDICATOR VARIABLE AT CAPACITY = 28.0 [microV/V] > Bl- DIRECTIONAL 


^^^OeSQ^j-THRrsHOLD (S 

EE FOOTNOI^g^^ 1 ®^^® 

THRESHOLD (S 

EE FOOTNOTE) 


-2500. 0. 2500. 

SI, lbs 


THRESHOLD = ±6.7 [mfcroV/V] ; INDICATOR VARIABLE AT CAPACITY = 32.7 [microV/V] > Bl- DIRECTIONAL 



-2500. 0. 2500. 

S2, lbs 


THRESHOLD = ±6.6 [microV/V] ; INDICATOR VARIABLE AT CAPACITY = -8.44e-01 [microV/V] > NOT Bl- DIRECTIONAL 



-8000. 0. 8000. 

RM, in-lbs 


THRESHOLD = ±7.4 [microV/V] ; INDICATOR VARIABLE AT CAPACITY = -2.5 [microV/V] > NOT BI-DIRECTIONAL 


THRESHOLD (S 

EE FOOTNOTE) 


^ A A 

y -o-o co oo -oo-o — oo <x> -o . h 

w-v o ^oo-oe> -oob — oo <30^0- o -c 

THRESHOLD (S 

EE FOOTNOTE) 


-400. 0. 400. 

AF, lbs 


Fig. 3a Indicator variable for bi-directionality of the MK40 balance. 
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NUMBER OF TERMS = 36, 36, 36, 36, 36, 36 

HIERARCHICAL: R1, R2, R3, R4, R5, R6 



(HIERARCHY 

ANALYSIS 

USES 

lF»Gl = IFI* IGI, 

IF*F*F| — IFI* 

IFUIFI, 

IFUIFI 

— F*F) 
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□ 
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□ 
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□ 

□ 
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□ 

□ 
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□ 

□ 

□ 

□ 
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□ 
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□ 
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□ 
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□ 

□ 
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□ 

□ 

□ 

□ 

□ 
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□ 

□ 
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□ 
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□ 
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□ 
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□ 
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□ 

□ 

□ 

N2*N2 







SI* IS2 1 

□ 

□ 

□ 

□ 

□ 

S1*S1 







SI *IRMI 

□ 

□ 

□ 

□ 

□ 

S2*S2 







SI* lAFl 

□ 

□ 

□ 

□ 

□ 

RM*RM 







S2*lRMl 

□ 

□ 

□ 

□ 

□ 

AF*AF 







S2* lAFl 

□ 

□ 

□ 

□ 

□ 

N1*lN1 1 







RM*lAFI 

□ 

□ 

□ 

□ 

□ 

N2*lN2l 







Ini i*n2 

□ 

□ 

□ 

□ 

□ 

S1*IS1 1 







INI l*S1 

□ 

□ 

□ 

□ 

□ 

S2*lS2l 







INI l*S2 

□ 

□ 

□ 

□ 

□ 

RM*lRMl 

□ 

□ 

□ 

□ 

□ 

□ 

Ini I*rm 

□ 

□ 

□ 

□ 

□ 

AF* lAFl 

□ 

□ 

□ 

□ 

□ 

□ 

Ini i*af 

□ 

□ 

□ 

□ 

□ 

N1 *N2 







lN2 l*S 1 

□ 

□ 

□ 

□ 

□ 

N1 *S1 







IN2 l*S2 

□ 

□ 

□ 

□ 

□ 

N1 *S2 







lN2l*RM 

□ 

□ 

□ 

□ 

□ 

N1 *RM 







lN2l*AF 

□ 

□ 

□ 

□ 

□ 

N1*AF 







IS1 l*S2 

□ 

□ 

□ 

□ 

□ 

N2+S1 







IS1 l*RM 

□ 

□ 

□ 

□ 

□ 

N2*S2 







IS1 l*AF 

□ 

□ 

□ 

□ 

□ 

N2*RM 







IS2l*RM 

□ 

□ 

□ 

□ 

□ 

N2*AF 







IS2UAF 

□ 

□ 

□ 

□ 

□ 

S1*S2 







lRMl*AF 

□ 

□ 

□ 

□ 

□ 

S1*RM 







N1*N1*N1 

□ 

□ 

□ 

□ 

□ 

SI *AF 







N2*N2*N2 

□ 

□ 

□ 

□ 

□ 

S2*RM 







SI *S1 *S1 

□ 

□ 

□ 

□ 

□ 

S2*AF 







S2»S2*S2 

□ 

□ 

□ 

□ 

□ 

RM*AF 







RM*RM*RM 

□ 

□ 

□ 

□ 

□ 

IN 1 *N2l 

□ 

□ 

□ 

□ 

□ 

□ 

AF*AF*AF 

□ 

□ 

□ 

□ 

□ 

IN 1 *S1 1 

□ 

□ 

□ 

□ 

□ 

□ 

IN 1 *N 1 *N 1 1 

□ 

□ 

□ 

□ 

□ 

1 N 1 *S2 1 

□ 

□ 

□ 

□ 

□ 

□ 

lN2*N2*N2l 

□ 

□ 

□ 

□ 

□ 

INI *RMl 

□ 

□ 

□ 

□ 

□ 

□ 

IS1 *S1 *S1 1 

□ 

□ 

□ 

□ 

□ 

IN 1 *AFl 

□ 

□ 

□ 

□ 

□ 

□ 

IS2*S2*S2 1 

□ 

□ 

□ 

□ 

□ 

1 N2*S 1 1 

□ 

□ 

□ 

□ 

□ 

□ 

lRM*RM*RMl 

□ 

□ 

□ 

□ 

□ 

lN2*S2l 

□ 

□ 

□ 

□ 

□ 

□ 

lAF*AF*AFl 

□ 

□ 

□ 

□ 

□ 

1 N2*RM 1 

□ 

□ 

□ 

□ 

□ 

□ 







lN2*AFl 

□ 

□ 

□ 

□ 

□ 

□ 







IS 1 *S2 1 

□ 

□ 

□ 

□ 

□ 

□ 








Fig. 3b First revision of the permitted math model after removal of terms related to \AF\ and \RM\. 
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□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□□ 


PHYSICAL VARIABLES & UNITS - REGRESSION COEFFICIENT ESTIMATES AND STATISTICAL METRICS (R1) 

REGRESSION MODEL HIERARCHY CHARACTERISTICS = HIERARCHICAL 


TERM 

TERM 

COEFFICIENT 

STANDARD 

T— STATISTIC OF 

P -VALUE OF 

VI F 

VI F 

INDEX 

NAME 

VALUE 

ERROR 

COEFFICIENT 

COEFFICIENT 

(PRIMARY) 

(ALTERNATE) 

1 

INTERCEPT -1.3915 

+ 0.0724 

-19.2079 

- 

- 

- 

2 

N1 

+ 0.3435 

+ 0.0001 

+3218.6915 

< 0.0001 

+3.1535 

1+1 1 .1063] 

3 

N2 

-0.0081 

+0.0001 

-73.9053 

< 0.0001 

+ 2.7146 

[+17.9385] 

4 

SI 

+ 0.0002 

+ 0.0001 

+ 1.5301 

+ 0.1262 

+ 2.6630 

[+17.7705] 

5 

S2 

+ 0.0002 

+ 0.0001 

+ 1.6099 

+ 0.1076 

+ 3.1455 

[+16.5158] 

6 

RM 

+ 0.0005 

+ 1.2244e-05 

+40.4820 

< 0.0001 

+ 1.2300 

+ 1.01 16 

7 

AF 

+ 0.0009 

+ 0.0003 

+3.5283 

+ 0.0004 

+ 1.0939 

+ 1 .0052 

8 

INI 1 

-0.0004 

+ 0.0002 

-2.2613 

+0.0239 

+ 16.6365 

[+16.1960] 

9 

IN2I 

+ 0.0021 

+ 0.0001 

+ 15.7623 

< 0.0001 

+ 18.6616 

[+18.6444] 

10 

IS1 1 

-0.0016 

+ 0.0003 

-6.0404 

< 0.0001 

+ 24.21 12 

[+24.2108] 

1 1 

IS2I 

-0.0003 

+ 0.0002 

-1.4333 

+0.1519 

+ 24.0623 

[+24.0623] 

14 

N1*N1 

+ 1 .2580e-06 

+ 7.0086e-08 

+ 17.9486 

< 0.0001 

+ 17.4855 

f+16.6215] 

15 

N2*N2 

— 1.3589e-08 

+ 5.061 7e-08 

-0.2685 

+ 0.7884 

+ 17.4322 

[+17.4844] 

16 

S1*S1 

-9.7985e-08 

+ 1.3672e-07 

-0.7167 

+0.4737 

+ 18.8256 

[+18.8276] 

17 

S2*S2 

+ 5.1 81 0e-08 

+ 1.1 008e-07 

+0.4707 

+ 0.6379 

+ 17.8914 

[+17.8914] 

18 

RM*RM 

-1.31 1 7e— 07 

+ 2.2 1 72e— 09 

-59.1601 

< 0.0001 

+ 1.1603 

+ 1.1606 

19 

AF*AF 

+ 2.6983e-06 

+ 8. 1 051 e-07 

+3.3291 

+0.0009 

+ 1.0432 

+ 1.0432 

20 

N 1 * IN 1 

+ 1 .2456e-06 

+ 5.1 676e-08 

+ 24.1036 

< 0.0001 

+ 1.3995 

[+1 1.4671] 

21 

N2-IN2I 

+ 1 .6883e— 07 

+ 4.51 20e— 08 

+3.7419 

+ 0.0002 

+ 2.6652 

[+18.2346] 

22 

SWISH 

-3.3343e-07 

+ 9.3268e-08 

-3.5750 

+0.0004 

+ 1.9403 

[+1 1.9933] 

23 

S2*IS2I 

-2.4281 e-07 

+ 8.6584e-08 

-2.8043 

+ 0.0051 

+ 2.3194 

[+15.1907] 

26 

N 1 *N2 

-1.4454e-07 

+ 2.2080e-08 

-6.5461 

< 0.0001 

+ 1.5788 

+ 1 .5701 

27 

N 1 *S1 

-1.1 806e— 06 

+ 6. 1 998e-08 

-19.0419 

< 0.0001 

+ 1.7335 

+ 1.5130 

28 

N1»S2 

+ 2.2981 e-08 

+ 5.5697e-08 

+ 0.4126 

+ 0.6799 

+ 1.7532 

+ 1 .5240 

29 

N 1 *RM 

-3.0356e-07 

+ 1 .9 1 96e-08 

-15.8139 

< 0.0001 

+ 1.2554 

+ 1 .0358 

30 

N1*AF 

-1.6866e-07 

+ 2.8800e-07 

-0.5856 

+0.5582 

+ 1.1 131 

+ 1.0233 

31 

N2*S1 

+ 1.1 083e— 07 

+4.9463e-08 

+ 2.2408 

+ 0.0252 

+ 1.51 10 

+ 1.5093 

32 

N2*S2 

- 1 .0591 e-07 

+ 4.4563e-08 

-2.3766 

+ 0.0176 

+ 1.5228 

+ 1 .5204 

33 

N2*RM 

+ 1.3387e— 08 

+ 1.4845e-08 

+0.9018 

+ 0.3673 

+ 1 .0344 

+ 1.0323 

34 

N2*AF 

+ 9.7052e— 08 

+ 2.4748e— 07 

+0.3922 

+ 0.6950 

+ 1.0199 

+ 1 .0227 

35 

S1*S2 

+ 3.2285e-08 

+ 5.0522e-08 

+0.6390 

+ 0.5229 

+ 1.8467 

+ 1 .8467 

36 

SI *RM 

-8.8355e-07 

+ 2.6275e-08 

-33.6266 

< 0.0001 

+ 1.4044 

+ 1 .4048 

37 

SI *AF 

— 3.8047e— 07 

+ 4. 1 734e— 07 

-0.91 17 

+ 0.3621 

+ 1.3182 

+ 1 .3183 

38 

S2*RM 

-8.6731 e-08 

+ 2.5668e-08 

-3.3790 

+ 0.0007 

+ 1.4104 

+ 1 .4108 

39 

S2.AF 

+4.6004e— 09 

+ 3.8899e-07 

+0.01 18 

+0.9906 

+ 1.3182 

+ 1.3183 

40 

RM*AF 

-3.4890e-08 

+ 1 .239 1 e-07 

-0.2816 

+ 0.7783 

+ 1.0000 

+ 1 .0001 


Fig. 3c Variance inflation factors of the math model of gage output R1 for the revised permitted math model. 
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NUMBER OF TERMS = 32, 32, 32, 32, 32, 32 

HIERARCHICAL: R1, R2, R3, R4, R5, R6 



(HIERARCHY 

ANALYSIS 

USES 

IF»GI = lFl*IGI, 

IF+F+F = IFI* 

IFI.IFI, 

IFI.IFI 

= F*F) 




R1 

R2 

R3 

R4 

R5 

R6 


R1 

R2 

R3 

R4 

R5 

R6 

INTERCEPT 







IS 1 *RMl 

□ 

□ 

□ 

□ 

□ 

□ 

N1 







IS 1 .AFl 

□ 

□ 

□ 

□ 

□ 

□ 

N2 







IS2.RMI 

□ 

□ 

□ 

□ 

□ 

□ 

SI 







IS2*AFl 

□ 

□ 

□ 

□ 

□ 

□ 

S2 







|RM*AFI 

□ 

□ 

□ 

□ 

□ 

□ 

RM 







N1.IN2I 

□ 

□ 

□ 

□ 

□ 

□ 

AF 







N 1 * IS 1 1 

□ 

□ 

□ 

□ 

□ 

□ 

In i 1 







N1.IS2I 

□ 

□ 

□ 

□ 

□ 

□ 

IN2I 







NUlRMl 

□ 

□ 

□ 

□ 

□ 

□ 

IS1 1 







Nl.lAFI 

□ 

□ 

□ 

□ 

□ 

□ 

IS2I 







N2*lS1 1 

□ 

□ 

□ 

□ 

□ 

□ 

IRMI 

□ 

□ 

□ 

□ 

□ 

□ 

N2*lS2l 

□ 

□ 

□ 

□ 

□ 

□ 

lAFl 

□ 

□ 

□ 

□ 

□ 

□ 

N2*lRMl 

□ 

□ 

□ 

□ 

□ 

□ 

N 1 *N 1 

□ 

□ 

□ 

□ 

□ 

□ 

N2*lAFl 

□ 

□ 

□ 

□ 

□ 

□ 

N2*N2 

□ 

□ 

□ 

□ 

□ 

□ 

SUIS2I 

□ 

□ 

□ 

□ 

□ 

□ 

SI *S1 

□ 

□ 

□ 

□ 

□ 

□ 

SI »lRMl 

□ 

□ 

□ 

□ 

□ 

□ 

S2*S2 

□ 

□ 

□ 

□ 

□ 

□ 

SI *IAFI 

□ 

□ 

□ 

□ 

□ 

□ 

RM.RM 







S2*lRMl 

□ 

□ 

□ 

□ 

□ 

□ 

AF*AF 







S2*lAFl 

□ 

□ 

□ 

□ 

□ 

□ 

N 1 * IN 1 1 







RM.IAFI 

□ 

□ 

□ 

□ 

□ 

□ 

N2*lN2l 







Ini I*n2 

□ 

□ 

□ 

□ 

□ 

□ 

S1*IS1 1 







INI l*S1 

□ 

□ 

□ 

□ 

□ 

□ 

S2*lS2l 







INI l*S2 

□ 

□ 

□ 

□ 

□ 

□ 

RM.lRMl 

□ 

□ 

□ 

□ 

□ 

□ 

INI l*RM 

□ 

□ 

□ 

□ 

□ 

□ 

AF.lAFl 

□ 

□ 

□ 

□ 

□ 

□ 

INI l+AF 

□ 

□ 

□ 

□ 

□ 

□ 

N1 *N2 







IN2US1 

□ 

□ 

□ 

□ 

□ 

□ 

N1»S1 







lN2l*S2 

□ 

□ 

□ 

□ 

□ 

□ 

N1*S2 







IN2I+RM 

□ 

□ 

□ 

□ 

□ 

□ 

N1*RM 







lN2l*AF 

□ 

□ 

□ 

□ 

□ 

□ 

N1.AF 







IS1 l*S2 

□ 

□ 

□ 

□ 

□ 

□ 

N2*S1 







IS1 l*RM 

□ 

□ 

□ 

□ 

□ 

□ 

N2*S2 







IS1 l*AF 

□ 

□ 

□ 

□ 

□ 

□ 

N2*RM 







lS2l*RM 

□ 

□ 

□ 

□ 

□ 

□ 

N2*AF 







IS2UAF 

□ 

□ 

□ 

□ 

□ 

□ 

S1*S2 







IRMI.AF 

□ 

□ 

□ 

□ 

□ 

□ 

SI *RM 







N1*N1*N1 

□ 

□ 

□ 

□ 

□ 

□ 

SI *AF 







N2*N2*N2 

□ 

□ 

□ 

□ 

□ 

□ 

S2*RM 







SI *S1 *S1 

□ 

□ 

□ 

□ 

□ 

□ 

S2*AF 







S2*S2»S2 

□ 

□ 

□ 

□ 

□ 

□ 

RM*AF 







RM.RM.RM 

□ 

□ 

□ 

□ 

□ 

□ 

Ini *N2i 

□ 

□ 

□ 

□ 

□ 

□ 

AF.AF.AF 

□ 

□ 

□ 

□ 

□ 

□ 

IN1.S1 1 

□ 

□ 

□ 

□ 

□ 

□ 

IN1.N1*N1 1 

□ 

□ 

□ 

□ 

□ 

□ 

Ini *S2l 

□ 

□ 

□ 

□ 

□ 

□ 

|N2*N2*N2 

□ 

□ 

□ 

□ 

□ 

□ 

In 1 *RM 1 

□ 

□ 

□ 

□ 

□ 

□ 

IS1*S1*S1 1 

□ 

□ 

□ 

□ 

□ 

□ 

IN 1 *AFl 

□ 

□ 

□ 

□ 

□ 

□ 

IS2*S2*S2I 

□ 

□ 

□ 

□ 

□ 

□ 

IN2.S1 1 

□ 

□ 

□ 

□ 

□ 

□ 

IRM.RM.RMI 

□ 

□ 

□ 

□ 

□ 

□ 

IN2+S2I 

□ 

□ 

□ 

□ 

□ 

□ 

lAF+AF.AFl 

□ 

□ 

□ 

□ 

□ 

□ 

lN2*RMl 

□ 

□ 

□ 

□ 

□ 

□ 








lN2*AFl 

□ 

□ 

□ 

□ 

□ 

□ 








IS1 *S2l 

□ 

□ 

□ 

□ 

□ 

□ 









Fig. 3d Second revision of the permitted math model after removal of square terms of N 1, N 2, 51, and 52. 
(second revision is used as the upper bound for both the original and simplified search algorithm) 
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NUMBER OF TERMS = 19, 22, 18, 17, 10, 24 


HIERARCHICAL: R1, R2, R3, R4, R5, R6 



(HIERARCHY 

ANALYSIS 

USES 

lF*Gl = lFl*lGl, 

|E*F*Fl= iFh 

IFI.lFl, 

IF1«IF 

= F*F) 



R1 

R2 

R3 

R4 

R5 

R6 


R1 

R2 

R3 

R4 

R5 

INTERCEPT 
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□ 
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□ 
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N1 
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□ 
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INI l*S2 
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□ 

□ 

□ 
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□ 

□ 

□ 

□ 

□ 
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□ 

□ 

□ 

□ 
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□ 
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□ 
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□ 
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□ 

□ 

□ 

□ 


lN2l*RM 

□ 

□ 

□ 

□ 

□ 

N1*RM 


□ 


□ 

□ 


lN2l*AF 

□ 
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□ 
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□ 

□ 

□ 
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□ 

□ 

□ 
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□ 




□ 


IS1 l*RM 

□ 

□ 

□ 

□ 

□ 

N2*S2 

□ 


□ 


□ 
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Fig. 4a Optimized math model obtained after application of original search algorithm, 
(regression model search was completed after ss 28 minutes of CPU time) 
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Fig. 4b Load residuals for optimized math model of original search algorithm. 
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Fig. 5a Optimized math model obtained after application of simplified search algorithm, 
(regression model search was completed after « 3 minutes of CPU time) 
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Fig. 5b Load residuals for optimized math model of simplified search algorithm. 
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